INVESTIGATION 



A Strategy to Identify Dominant Point Mutant 
Modifiers of a Quantitative Trait 

William F. Dove,*'* 4 Alexandra Shedlovsky,* Linda Clipson,* James M. Amos-Landgraf,*' 1 
Richard B. Halberg,* 2 Kathleen J. Krentz,*' 3 Frederick J. Boehm,* Michael A. Newton,* § 
David J. Adams,** and Thomas M. Keane** 

*McArdle Laboratory for Cancer Research, Department of Oncology, laboratory of Genetics, ^Department of Statistics, 
and §Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, 
and **Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK 



ABSTRACT A central goal in the analysis of complex traits is to identify genes that modify a phenotype. KEYWORDS 
Modifiers of a cancer phenotype may act either intrinsically or extrinsically on the salient cell lineage. intestinal 
Germline point mutagenesis by ethylnitrosourea can provide alleles for a gene of interest that include loss-, neoplasia 
gain-, or alteration-of-function. Unlike strain polymorphisms, point mutations with heterozygous quantitative genetic modifiers 
phenotypes are detectable in both essential and nonessential genes and are unlinked from other variants ENU 
that might confound their identification and analysis. This report analyzes strategies seeking quantitative mutagenesis 
mutational modifiers of Apc Mln in the mouse. To identify a quantitative modifier of a phenotype of interest, isogenicity 
a cluster of test progeny is needed. The cluster size can be increased as necessary for statistical significance 
if the founder is a male whose sperm is cryopreserved. A second critical element in this identification is 
a mapping panel free of polymorphic modifiers of the phenotype, to enable low-resolution mapping 
followed by targeted resequencing to identify the causative mutation. Here, we describe the development 
of a panel of six "isogenic mapping partner lines" for C57BL/6J, carrying single-nucleotide markers in- 
troduced by mutagenesis. One such derivative, B6.SNVg, shown to be phenotypically neutral in combina- 
tion with Apc Min , is an appropriate mapping partner to locate induced mutant modifiers of the Apc Min 
phenotype. The evolved strategy can complement four current major initiatives in the genetic analysis of 
complex systems: the Genome-wide Association Study; the Collaborative Cross; the Knockout Mouse 
Project; and The Cancer Genome Atlas. 



Copyright © 2014 Dove et al. 
doi: 10.1534/g3.114.010595 

Manuscript received February 11, 2014; accepted for publication April 8, 2014; 
published Early Online April 17, 2014. 

This is an open-access article distributed under the terms of the Creative 
Commons Attribution Unported License {http://creativecommons.org/iicenses/ 
by/3.0/), which permits unrestricted use, distribution, and reproduction in any 
medium, provided the original work is properly cited. 

Supporting information is available online at http://www.g3journal.org/lookup/ 
suppl/doi:1 0.1 534/g3.1 1 4.01 0595/-/DC1 

Present address: Department of Veterinary Pathobiology, University of Missouri, 
Columbia, MO. 

2 Present address: Department of Medicine, University of Wisconsin - Madison, 
Madison, Wl. 

3 Present address: Transgenic Mouse Facility, Biotechnology Center, University 
of Wisconsin - Madison, Madison, Wl. 

Corresponding author: 1400 University Avenue, Madison, Wl 53706. 
E-mail: dove@oncology.wisc.edu 



Genetic analysis of a phenotype in a multicellular organism starts with 
the discovery of those genes that influence the process in the whole 
organism. Loss-of-function alleles provide the least ambiguous route to 
understand the role, direct or indirect, of the functional wild-type allele 
in the process of interest. Gain-of-function and neomorphic alleles can 
provide further functional information. The action of the gene in the 
whole animal is ascertained by studying animals that are mosaic or 
chimeric for mutant and wild-type tissue (Hotta and Benzer 1972). 
When a gene of interest has been identified at the molecular level, 
longitudinal cellular studies provide complementary information on 
the spatial and temporal patterns of that gene's actions. 

Cancer in a metazoan is itself a multicellular process. Dramatic 
progress is emerging from The Cancer Genome Adas (Vogelstein 
et al. 2013) and the Cancer Epigenome Project (Beck et al. 2012) to 
document mutations and epigenetic changes common to the neoplas- 
tic lineages of each cancer histotype. However, biologists recognize 
that a cancer is more than its genome. Beyond the epithelial lineage 
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of the carcinoma, stromal elements can extrinsically influence the 
biology of the tumor. Such elements include leukocytes (Nosho et al. 

2011) , mast cells (Khan et al. 2013), neutrophils (Sinnamon 
et al. 2008), macrophages and cancer-associated fibroblasts (Edin 
et al. 2012; Herrera et al. 2013), and endothelial cells and the mi- 
crovasculature of the tumor (Yekkala and Baudino 2007; Pitroda 
et al. 2012). Genes acting on the neoplastic lineage, intrinsically or 
extrinsically, to affect the cancer phenotype can be discovered using 
modifier genetics (Dietrich et al. 1993; Gould and Dove 1996; Cormier 
et al. 1997). Genome-wide discovery programs for polymorphic modi- 
fiers of cancer risk are being carried out both in human populations 
(Gabriel et al. 2002; Carvajal-Carmona et al. 2011; Peters et al. 2012; 
Cai et al. 2013) and in rodent models (Ewart-Toland et al. 2003; Elahi 
et al. 2009; Kwong and Dove 2009; Crist et al. 2011; Liu et al. 2011; 
Smits et al. 2011; Quan et al. 2011; Eversley et al. 2012; Nnadi et al. 

2012) . However, identifying the causative elements underlying a poly- 
morphic quantitative risk modifier is a Herculean task (Drinkwater 
and Gould 2012; e.g., see Lewis and Tomlinson 2012). 

An important consideration is that the inbred strains of mice or 
rats that provide the starting material for modifier screens must carry 
functional alleles in all vital genes. Thus, the informative loss-of- 
function alleles of vital genes are absent from a modifier screen that 
emerges from an inbred line. 

These considerations have driven us to design a strategy for the 
genome-wide discovery of mutagen-induced alleles that quantitatively 
modify the tumor-inducing phenotype of the dominant Multiple in- 
testinal neoplasia (Min) allele of the Adenomatous polyposis coli (Ape) 
gene of the mouse (Moser et al. 1990). The mutagen JV-ethyl-N-nitro- 
sourea (ENU) predominantiy causes single point mutations in the 
rodent germline. This simplicity has enabled facile identification of 
a growing set of genes, identified as ENU-induced mutant alleles (Su 
et al. 1992; King et al. 1997; Garcia-Garcia et al. 2005; Van Boxtel et al. 
2010). ENU mutagenesis produces a range of point mutations 
(Takahasi et al. 2007). The frequency of such mutations in the sperm 
of an efficiently mutagenized mouse or rat is in the order of 10 -3 per 
locus. A "one-hit library" has a size expected to contain, on average, 
one mutant for each locus. Thus, a one-hit mutational library of 
first-generation offspring of a mutagenized male has a size in the 
order of 1000 founders. By contrast, transposon-mediated mutagen- 
esis of the mouse germline is significantly less efficient and does not 
yield the range of alleles produced by ENU. 

Mutations causing a qualitative phenotype in monogenic traits can 
be identified without first being mapped, using identity-by- descent in 
genome- wide sequencing data (Arnold et al. 2011; Hilton et al. 2011). 
For example, two-generation screens can be used to identify new 
alleles of known genes (Shedlovsky et al. 1993) or new loci (Kumar 
et al. 2011). By contrast, individual components of a highly complex 
trait do not show qualitative patterns of inheritance. To detect a mu- 
tant allele that quantitatively modifies a phenotype of interest, a cluster 
of progeny must be phenotyped. Extensive progeny testing becomes 
feasible if the kindred's founder is a male whose germline is cryopre- 
served to permit extensive retesting. Thus, quantitative modifying 
alleles for these traits are identified by low-resolution mapping fol- 
lowed by targeted sequence analysis to identify the causative mutation. 
Such mapping requires crossing the candidate founder to a partner 
carrying distinct marker alleles distributed over the genome. However, 
outcrossing to another inbred strain may introduce polymorphic 
modifiers from the donor mapping strain that can obfuscate the 
quantitative phenotype of interest (Elahi et al. 2009; Crist et al. 
2011; Liu et al. 2011; Eversley et al. 2012; Nnadi et al. 2012). There- 
fore, we have employed germline mutagenesis, followed by inbreeding, 



to create a set of "isogenic mapping partners" for the canonical 
C57BL/6J (B6) mouse strain. To begin, one of these lines, B6.SNVg, 
has been shown by progeny testing to be phenotypically neutral in 
respect to the Apc Mm phenotype. Resequencing of this line has estab- 
lished a set of induced single-nucleotide variant (SNV) markers cov- 
ering the genome at sufficient density to make B6.SNVg a suitable 
partner for mapping a new ENU-induced dominant modifier of the 
Apc Mm phenotype on the B6 background. 

MATERIALS AND METHODS 
Mice 

Mice were maintained under a protocol approved by the Animal Care 
and Use Committee of the University of Wisconsin School of Medicine 
and Public Health and in a facility in the McArdle Laboratory approved 
by the American Association of Laboratory Animal Care. Animals 
were housed in standard caging with free access to mouse chow and 
acidified water. 

BTBR/Pas (BTBR) mice, initially obtained from J-L. Guenet at the 
Institut Pasteur in Paris, France, were inbred in our laboratory. 

C57BL/6J-A^)c M " , (B6-Apc Min ) mice were obtained from the 
McArdle colony. The strain was maintained by backcrossing every 
fifth generation to C57BL6/J (B6) mice from The Jackson Laboratory 
(Bar Harbor, ME). Progeny from B6-Apc Mm/+ mothers were fostered 
to ICR foster mothers to enhance their probability of survival. 

Along with others, we have observed that dominant modifiers of the 
Apc Mln phenotype arise spontaneously in the B6 stock (Baran et al. 2007). 
Therefore, we now maintain a closed colony, C57BL/6JD-Apc Mm (D for 
Dove), and monitor it regularly for tumor multiplicity to prevent in- 
trogression of modifying alleles that can arise through genetic drift in the 
B6 stock. Through monitoring, the average tumor multiplicity in the 
closed colony is kept at approximately 100 ± 30 for mice at 90— 120 d 
of age. 

Lines B6.SNVb, B6.SNVc, B6.SNVe, B6.SNVf, B6.SNVg and B6.SNVh 
were developed using ENU mutagenesis of B6 male founders, followed by 
at least 10 generations of inbreeding. They carry single nucleotide variant 
sets b, c, e, f, g, and h, respectively. 

ENU mutagenesis 

BTBR males were mutagenized with 250 mg of ENU per kg body 
weight using the protocol previously described (Shedlovsky et al. 
1986). Because B6 males are permanently sterilized at 250 mg of 
ENU per kg body, we mutagenized them with three weekly injections 
of 100 mg per kg body weight. 

Tumor scoring 

Each intestinal tract was removed, opened longitudinally, and laid out 
on bibulous paper. The samples were fixed in 10% formalin overnight, 
then transferred to 70% ethanol for long-term storage. Tumor counts 
were obtained for the entire small intestine and colon using a Nikon 
SMZ-U dissecting microscope. 

Sequencing 

Paired-end 75-bp reads were generated by end-sequencing DNA 
sheared to an average of 350 bp. Sequencing was performed on the 
Alumina GAIIx platform to an average depth of 5.7 times genome 
coverage (Supporting Information, Table SI). These data were map- 
ped to the mm9/NCBIm37 assembly using BWA v0.5.9 (Li and 
Durbin 2009). Variants were called using Samtools (vO.1.17) mpi- 
leup/bcftools and then filtered selecting those variants for which the read 
depth was 3 or greater. Major changes in copy number or sequencing 
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efficiency were culled by filtering out signals stronger than 2.5 times the 
sequencing coverage of the sample. Data are shown in File SI. 

Sequenom validation 

In selecting candidates from line B6.SNVg for Sequenom validation, 
the goal was to find three candidates on each of the autosomes — near 
the centromere, middle, and telomere — fitting the following criteria: 

• Phred scale genotyping quality score of 99. 

• No significant repeats in flanking sequence. 

• Appearing in both B6.SNVg and B6.SNVh (known to be related), 
but in no other line. 

• A>TorT>A transversions were preferred, followed by A > G 
or T > C and G > A or C > T transitions. ENU mutagenesis 
enhances the proportion of AT to TA mutations, comprising 
18.7% of the unique mutations in this study, in contrast to 9% 
of spontaneous mutations (14849300 of 164137670 B6.SNVs 
across 18 strains) (Keane et al. 2011). 

Beyond the Phred scale quality rating of 99, the aforementioned 
criteria had to be relaxed in regions where mutations were sparse. 
Sequenom genotyping was performed as previously described (Gabriel 
et al. 2009). Chosen candidate variants in line B6.SNVg are validated 
in this communication; candidates in the other lines remain to be 
validated. 

RESULTS 

Outcross design with clustered progeny testing 

Our initial strategy to discover ENU-induced alleles that quantitatively 
modify the Apc Mm phenotype involved outcrossing a first-generation 
carrier of a mutagenized BTBR paternal genome to a B6-Apc Mm/+ 
tester. BTBR carries the same sensitive allele as B6 at the known 
modifier Moml locus (Gould et al. 1996), leading us to choose this 
partner to B6-Apc Mm in the outcross design. BTBR males, mutagen- 
ized as described in the section Materials and Methods, were bred to 
normal BTBR females to produce first-generation (Gl) males that 
were potential carriers of Apc Mm modifiers. These were first screened 
for such modifiers based upon the life span of their Fl(B6-Apc Mm/+ X 
Gl) offspring. Figure 1A shows the breeding designs used. 

The presenting phenotype in this initial foray was that of survival. 
Of 172 kindreds sired by Gl males, our attention was drawn to four 
kindreds carrying modifiers of the Apc Mm survival phenotype: two 
putative dominant suppressors (258 and 201) and two putative dom- 
inant enhancers (333 and 415). Each of these kindreds was identified 
for further study by observing more than one animal with an extreme 
lifespan - long for the putative dominant suppressors and short for 
the putative dominant enhancers. Figure 2 displays the survival of 
control Fl (B6-Apc Min/+ X BTBR) animals compared with Fls from 
all mutagenized males, as well as the survival of progeny within the 
aforementioned four kindreds. 

Differences in the lifespan distribution between unaffected Fl 
animals and potentially affected G1F1 animals reflect both the rate 
and quantitative effect of ENU-induced modifiers. A statistical 
analysis of these differences considers the distribution of mutagenized 
G1F1 Min animals as a mixture of three components (unaffected, 
enhanced, suppressed), and uses the maximum likelihood method to 
estimate both the rate at which ENU induces a phenotypic modifier 
and the distribution of multiplicative modifier effects (Figure 3 and 
File S2). From this calculation, we estimate that 39% of the mutagen- 
ized kindreds are free of dominant modifier alleles, 41% carry dom- 
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Figure 1 Breeding schemes to detect mutagen-induced dominant 
quantitative modifiers of the Apc Min phenotype. Three different 
designs are shown. The term "cluster" indicates that multiple progeny 
were screened to determine whether the kindred differed significantly 
from the mean. (A) Outcross modifier screen of BTBR G1 males based 
first upon survival and subsequently upon tumor multiplicity. (B) Iso- 
genic modifier screen of B6 mutagenized males, coupled with the 
ability to save sperm from candidate animals. (C) Design for producing 
inbred B6.SNV lines from mutagenized B6 males. Note that inbreed- 
ing eliminates recessive lethal or detrimental mutations. The existing 
B6.SNV lines have been inbred for at least 10 generations. The re- 
sidual heterozygosity expected at F10 is 14% (Green 1981). 

inant suppressor alleles that lead to reduced tumor multiplicity and 
longer lifespan, and 20% carry dominant enhancer alleles that have 
the opposite phenotype. Importantly, though a high proportion (61%) 
of mutagenized kindreds appears to carry modifiers of Apc Mm , it is 
evident that most are of small effect 5 (Kwong and Dove 2009). Equa- 
tion (2) in File S2 gives a formula for the enrichment in large modifier 
effects that can be achieved by selecting kindreds having multiple 
animals with extreme phenotype, as was done with kindreds 201, 
258, 333, and 415 (Figure S3 and Figure S4). 

In practice, the quantitative character of this screen for dominant 
modifiers of the Apc Mm phenotype was first analyzed on the basis of 
a simplifying assumption: that the time of survival of a Apc Mm/+ 
animal was correlated with its tumor multiplicity. The four candidate 



5 "When one tugs at a single thing in nature, he finds it attached to the rest of the world."— 
John Muir 
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Figure 2 Candidate kindreds of dominant modifiers of Apc M,n identi- 
fied by survival but not successfully mapped. The lower panel shows the 
survival curve of F1(B6-Apc M,n X BTBR) Apd JIW+ controls (gray line) and 
of Apc M,n/+ progeny from mutagenized BTBR outcrossed to B6-Apc M ' n/+ 
females (G1F1s, black line). The upper panel show the survival times of 
Apc Min/+ progeny from four different mutagenized BTBR G1 males (each 
the founder of a kindred). The four kindreds shown were chosen from 
172 test kindreds that generated clusters of Apc Min/+ progeny with ex- 
treme short or long lifespans. 



kindreds, first identified by survival, were further analyzed by tumor 
multiplicity of test progeny. The requirement for extensive progeny 
testing on the basis of both survival and tumor multiplicity led to an 
elapsed decision time that exceeded the lifespan of the founding first- 
generation mice. To implement this strategy, the germlines of these 
founders need to be maintained through sperm cryopreservation. 
However, recovery of BTBR/Pas from cryopreserved sperm remains 
elusive, while success with B6 sperm has only recently been signifi- 
cantiy improved (Takeo and Nakagata 2011) and confirmed in our 
group by K.J.K. 

Isogenic design 

Our attempts to map the presumptive dominant enhancer in kindred 
333 and suppressor in kindred 258 were unsuccessful owing to 
modifying alleles polymorphic between BTBR/Pas and B6, such as 
Mom7 (Kwong et al 2007). Indeed, 18 such polymorphic modifier loci 
of the Apc Mm tumor multiplicity phenotype have been reported to 
date (Eversley et al. 2012; Nnadi et al. 2012). The requirement for 
quantitative precision in this genetic strategy led us to move beyond 
the initial, informative outcross design and to develop an "isogenic 
design" (Figure IB). This method requires a set of "isogenic mapping 
partners" for B6 that carry markers throughout the genome. To de- 
velop these partners we mutagenized several B6 males as described in 
the Materials and Methods and bred each to normal B6 females 
(Figure 1C). Progeny of these crosses were randomly selected to give 
seven brother by sister pairs to intercross for at least 10 generations. 
One line (B6.SNVd) failed to achieve 10 generations of inbreeding, 
presumably owing to homozygosity for a recessive lethal mutation. 
The other six lines achieved 10 or more generations of inbreeding 
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Figure 3 A statistical analysis of the differences in lifespan considers 
the distribution of mutagenized G1F1 Min animals as a mixture of 
three components (unaffected, long-lived, short-lived), and uses the 
maximum likelihood method to estimate both the rate at which ENU 
induces a phenotypic modifier and the distribution of multiplicative 
modifier effects (File S2). From this calculation, we estimated that 39% 
of the mutagenized kindreds are free of dominant modifier alleles, 
41% carry dominant suppressor alleles that lead to reduced tumor 
multiplicity and longer lifespan, and 20% carry dominant enhancer 
alleles that have the opposite phenotype. A high proportion (61%) 
of mutagenized kindreds appear to carry modifiers of Apc Min , but most 
are of small effect. Equation (2) in File S2 gives the formula to estimate 
the enrichment in large modifier effects that can be achieved by 
selecting kindreds having multiple animals with an extreme survival 
phenotype. 



without evident homozygosity for a recessive detrimental or lethal 
mutation. These B6.SNV lines, each with a set of SNVs came from 
separate lineages with the exception of two lines (B6.SNVg and h) that 
had a common first- generation parent. 

Progeny test of the B6.SNVg line for dominant 
modification of the Apc Min phenotype 

The B6.SNVg line was tested for dominant modifiers of the Apc Mm 
phenotype in crosses to B6-Apc Mm . Females of the B6.SNVg line were 
bred to B6-Apc Mm animals from the McArdle Laboratory stock. 
(B6.SNVg X B6- Apc Min ) and Apc Minl+ progeny were killed at 90-100 
d of age for tumor scoring. In parallel, tumor counts were established 
from (B6 X B6-Apc Min/+ )Vl Apc^ inl+ control progeny, sired by the 
same males (Figure 4). No significant difference was observed between 
the two groups (P = 0.89 by two-sided Wilcoxon Rank Sum test). 

Estimation of the size of the progeny test needed 
to detect a carrier of a mutant modifier of Min 

The number of progeny needed to detect a Generation 1 carrier of an 
ENU-induced modifier at power 0.9 was estimated from the standard, 
overdistributed tumor multiplicity phenotype of B6-Apc Mml+ (File S2). 
In the first iteration of this assessment, we have used tumor multi- 
plicity alone to score the phenotype. In a second iteration, we have 
first preselected kindreds that are more likely to contain modifiers on 
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B6 x B6-Apc**" A B6-SNVg x B6-Apc M '" A 
Cross 

Figure 4 Progeny testing indicates that the B6.SNVg line carries no 
dominant modifiers of the Apc M,n/+ phenotype. B6-Apc Min/+ males 
were each mated both to B6 females and B6.SNVg females. Tumors 
counts of Apc M ' n/+ progeny averaged 75 ± 40 for B6 x B6-Apc Min/+ 
and 79 ± 23 for B6.SNVg x B6-Apc M,n/+ (P = 0.89, two-sided Wilcoxon 
rank sum test). The analysis has been limited to comparing two sets of 
progeny paired by B6-Apc M ' n/+ sires in common, because the tumor 
multiplicities of our B6-Apc M,n/+ colony varied significantly over time 
until the closed colony C57BL/6JD was established. 

the basis of shortened or lengthened survival, as summarized above. In 
each case, we have used a negative binomial model similar to that 
described more fully in the legend to Figure 5. We ask that a reported 
list of kindreds, controlled at 5% false discovery rate, be nonempty 
with at least 95% probability. The dimensions of this progeny test at 
power 0.9 for specific magnitudes of dominant effect of an ENU- 
induced modifier are shown in Figure SI. 

In the absence of preselection, for example on the basis of survival 
times, a prohibitive number of kindreds must be screened to the 
following depths. As shown in Figure SI A, to be 95% confident of 
discovering at least one allele with a twofold enhancing effect, 350 
kindreds must be screened. For such a dominant suppressor allele, 500 
kindreds would be needed. 

These estimates indicate that a realistic screen for dominant 
enhancer and suppressor alleles of the overdistributed Min phenotype 
must involve either preselection of kindreds of interest or sharpening the 
Apc Mm phenotype (Newton and Hastie 2006). The estimated distribu- 
tion of the frequencies and fold effect of dominant enhancer and sup- 
pressor alleles in the population of Generation 1 founders (Figure 3) and 
the necessary sizes of test progeny clusters (Figure 5) lead in this case to 
the estimates summarized in Figure SIB. For example, to detect at least 
one dominant enhancer allele with a twofold effect, approximately 13 
short-lived kindreds must be tested, each with 20 Apc M,n/+ progeny. The 
corresponding estimate to detect at least one dominant suppressor allele 
is 14 long-lived kindreds, each tested with 30 Apc Mm/+ progeny. 

Sequencing of the B6.SNV lines 

Genomes of six of the B6.SNV lines were sequenced as described in 
Materials and Methods. We identified across all 6 B6.SNV fines a total 
of 22,911 high confidence candidate SNV positions from the sequenc- 
ing data by comparison to the B6 (mm9/NCBIm37) reference genome 
(Table SI). 

A B6.SNVg mapping panel of confirmed SNV sites 

Because progeny tests indicated that B6.SNVg does not dominantly 
affect the tumor multiplicity phenotype of B6-Apc Mm (Figure 4), we 
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Figure 5 The number of progeny is estimated needed to test fully 
a mutagenized paternal gamete carried in a Generation 1 founder of 
a kindred. The null hypothesis tested is that the kindred carries no 
ENU-induced modifier. The tumor multiplicity data are assumed to 
follow a negative binomial distribution around a mean of 99.8 tumors 
and an overdispersion shape parameter of 9.8, as estimated from 
recent data from our closed colony, C57BL/6JD.Apc Mln . If a modifier is 
carried by a kindred in heterozygous form, it is assumed to segregate 
and to affect only the tumor multiplicity and its variance, but not the 
shape parameter. The number of progeny is shown on the x-axis re- 
quired to achieve sufficient power (on the y-axis) to detect at a 5% 
significance level a fold-effect given by the lines. The power calcula- 
tions are based on the approximation to a normal distribution of the 
average tumor multiplicities within the test kindred. 

have developed a mapping panel of SNV sites in this line. Thus, it can 
serve as the isogenic mapping partner to locate new ENU-induced 
dominant modifiers of the Apc Mm phenotype. 

A total of 180 SNV candidate marker sites were chosen for 
confirmation by Sequenom genotyping as described in the Materials 
and Methods with 148 entering the final assay. Validation was carried 
out on DNA samples from 31 C57BL/6JD, six B6.SNVg, one B6.SNVh, 
and 41 (C57BL/6JD x B6.SNVg)Fl mice. The results are displayed in 
File S2. Of the 148 candidate sites tested, 8 failed to give Sequenom 
results, so data are shown for 140 candidates. For 5 of these 140, the 
results may indicate either divergence between our C57BL/6JD and 
the canonical B6 used in sequencing NCBIm37, or else errors in that 
sequence. One candidate does not vary between B6.SNVg and 
C57BL/6JD, probably reflecting an error in the B6.SNVg sequence. 
Eleven of the candidates show evidence for residual heterozygosity 
in the B6.SNVg line, which is consistent with the expectation of 1 1% 
for 12 generations of inbreeding (Green 1981). This heterozygosity is 
not obligatory; the set of progeny includes B6.SNV/B6.SNV homo- 
zygotes at each of these eleven sites. Thus, 123 sites were validated as 
being ENU-derived and homozygous in B6.SNVg. 

The use of B6.SNVg as a mapping partner 

Figure 6 presents the set of B6.SNVg sites that can serve the purpose 
of mapping a newly induced dominant modifier allele on B6. The 
meiotic maps of mammalian genomes show strong positive crossover 
interference (Broman et al. 2002). Thus, one expects that this mapping 
panel will permit a new mutation to be mapped to a defined region of 
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the B6 genome. Targeted resequencing of this region in DNA from 
carriers of the newly induced modifier is then expected to yield one, or 
at most a small number, of candidate single base pair mutations; these 
can be confirmed by allelic substitution by the CRISP/Cas9 process 
(Wang et al 2013). 

The frequency and pattern of ENU-induced mutations 

Table SI summarizes the resequencing analysis of the six B6.SNV 
lines. The frequency of presumptive mutations per base pair reported 
from the canonical sequence of B6 (NCBIm37/mm9) was 1.3 x 10 4 in 
6 x 2 x 10 9 base pairs covered = 1.1 x 10~ 6 per base pair. This 
frequency is consistent with those previously reported (Michaud 
et al. 2005; Takahasi et al. 2007). Table 1 reports the numbers of 
line-specific mutations on each chromosome; Table 2 summarizes 
the spectrum of the unique mutations. Transitions from AT to GC 
and GC to AT ranked 1st and 2nd, whereas transversions between AT 
and TA ranked a close 3rd. This spectrum is similar to those reported 
in the study of Takahasi et al. (2007) and in the review by Barbaric 
et al. (2007). 

The distribution across the genome of the ENU-induced 
alleles in the SNV lines 

Are regions of the genome of a SNV line devoid of markers, owing to 
homozygous lethality of mutations? The complete set of line-specific 
SNVs for the full set of B6.SNV lines is displayed in Figure S2. Full 
coverage of the genome, with some heterozygosity, is observed in three 
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Figure 6 Location of point mutations in B6.SNVg that were validated by 
Sequenom testing. A total of 180 SNV candidate sites were chosen for 
confirmation by Sequenom genotyping, with 140 giving results in the final 
Sequenom assay. Shown are the 123 sites that are homozygous in 
B6.SNVg and the 1 1 sites showing residual heterozygosity. Validation was 
carried out on DNA samples from C57BL/6JD, B6.SNVg, B6.SNVh, and 
(C57BL/6JD x B6.SNVg)F1 mice (see File S4). The map displays the set of 
validated SNV sites that can serve the purpose of mapping to low reso- 
lution a newly induced dominant modifier allele by crosses of the B6 
carrier of the new modifier to B6.SNVg. 



of the six lines— B6.SNVb, B6.SNVe, and B6.SNVg. At least one gap of 
over 30 Mpb is observed in each of the other lines. B6.SNVc is devoid of 
markers in two regions: Chr 7 at 100-144 Mbp; and Chr 11: 28-58 
Mbp. B6.SNVf (whose resequencing coverage was only 1.7-fold) is de- 
void of markers in seven regions: Chr 5 at 43 — 77 Mbp; Chr 6 at 68—99 
Mbp, Chr 11 at 89- 121 Mbp, Chr. 12 at 26-67 Mbp; Chr 14 at 60-89 
Mbp, Chr 16 at 43-77 Mbp; and Chr 1 at: 22-54 Mbp. B6.SNVh is 
devoid of markers on Chr 15 at 22 — 52 Mbp. Thus, in general the 
densely mutagenized genomes of the six B6.SNV lines support homo- 
zygosity, without loss of markers from particular regions. 

DISCUSSION 

We have developed a general genome-wide strategy to identify ENU- 
induced mutations that dominantly modify the Apc Mm phenotype. In 
important ways this strategy complements others that are being ac- 
tively pursued for the analysis of complex traits in mammalian biology 
such as cancer. The Genome-Wide Association Studies (GWAS, 
http://gwas.nih.gov/) and the Collaborative Cross (CC, http://csbio. 
unc.edu/CCstatus/index.py) reveal germline haplotypes that contrib- 
ute to disease risk in the human and mouse population, respectively. 
By contrast, an allelic series of ENU-induced point mutations (see 
Table 2 in Takahasi et al. 2007) enables functional analysis of each 
gene affecting the trait of interest. Although loss-of- function alleles are 
particularly informative for cause-effect analysis, these alleles cannot 
be present among vital genes in the inbred mouse strains that generate 
the CC. The cancer genome and epigenome are addressed by The 
Cancer Genome Atlas (TCGA, http://cancergenome.nih.gov/) and the 
Cancer Epigenome Project (Beck et al. 2012). These projects identify 
mutations and other somatically heritable changes that act directly 
within the cancer lineage but not in the stroma. By contrast, modify- 
ing alleles can act in the tumor lineage, the stroma, or the host at large. 

The detection of a subtle phenotype caused by a mutation in a single 
gene of a complex system is enhanced by amplifying the number of 
progeny displaying the mutant phenotype. Here, we describe a statistical 
approach to detect kindreds carrying dominant alleles that modify the 
tumor multiplicity phenotype. We have modeled briefly the null and 
modified distribution of the average tumor count within each kindred 
and calibrated this statistic, accounting for background phenotypic 
variation and modifier segregation. The power to detect a mutant signal 
is a function of the number of test progeny. Thus, the ability to recover 
a B6 line from cryopreserved sperm, to amplify a kindred under isogenic 
conditions, is a crucial element in implementing this quantitative 
modifier screen. 

The advent of cost-effective sequencing of mammalian genomes 
led to the association (by identity-by-descent) of mutant phenotypes 
with particular segregating ENU-induced mutations. However, for 
genetically highly complex traits, it is likely to be necessary to delimit 
the genomic region carrying the new mutation of interest for targeted 
resequencing. The known point mutations preferentially caused by 
ENU simplify both the molecular identification of the gene of interest 
and the analysis of its mechanism of action. However, the distribution 
of distances found between adjacent candidate SNV mutations provides 
a measure of caution against the assumption that a single mutation is 
responsible for the mutant phenotype. Indeed, SNP sites in the human 
genome show evidence for nonrandom clustering (Amos 2010). We 
observe that the median distance (50th percentile) between in the 
SNV sites is 377,740 bp and the interquartile range is 48,852- 
1,115,595 (25th to 75th percentile) (Table S2). Thus, it is likely that 
a new modifier allele can be traced to a single mutation, but it will 
ultimately be necessary to validate the candidate by methods such as 
genome editing (Wang et al. 2013). 
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Table 1 Distribution of line-specific candidate variants by chromosome and line 



No. Candidate Variants 





Only B6.SNVb 


Only B6.SNVc 


Only B6.SNVe 


Only B6.SNVf 


Only B6.SNVg 


Only B6.SNVh 


Only B6.SNVg 


1 


300 


300 


89 


108 


52 


81 


9 


2 


206 


232 


51 


53 


121 


78 


19 


3 


244 


198 


144 


56 


47 


263 


54 


4 


348 


215 


254 


174 


36 


105 


21 


5 


95 


246 


110 


104 


82 


218 


46 


6 


175 


108 


54 


49 


96 


56 


49 


7 


187 


57 


127 


57 


46 


236 


102 


8 


179 


84 


78 


46 


81 


138 


92 


9 


132 


63 


108 


33 


106 


42 


12 


10 


114 


216 


115 


62 


79 


127 


73 


11 


159 


71 


76 


34 


75 


37 


5 


12 


149 


121 


119 


43 


34 


122 


10 


13 


120 


185 


57 


40 


31 


104 


70 


14 


114 


103 


158 


26 


36 


221 


23 


15 


92 


69 


117 


59 


29 


67 


15 


16 


163 


57 


42 


35 


43 


160 


98 


17 


138 


180 


85 


97 


47 


113 


54 


18 


92 


111 


39 


48 


61 


136 


7 


19 


54 


89 


75 


6 


36 


51 


11 


X 


185 


227 


153 


68 


87 


165 


22 


Total 


3246 


2932 


2051 


1198 


1225 


2520 


792 



In addition to the 13,172 candidate variants each found only in a single line, 792 other candidate variants are each found in lines B6.SNVg and B6.SNVh but in no 
other lines. B6.SNVg and B6.SNVh share founders, but the other lines are independent. Chr, chromosome. 



The evolved strategy discovers modifying genes on the basis of the 
heterozygous phenotype of an ENU-induced mutation. In what ways 
does this restriction limit the genome-wide coverage to discover the full 
set of genes that play a causative role in the biological process of interest? 
Application of mutagen-induced dominant modifier genetics broadens 
the genetic analysis by discovering loss-of-function alleles of vital genes 
that must be absent from inbred populations. Although such an allele 
can be maintained in heterozygous form, will the heterozygote have 
a phenotype? Alleles at quantitative trait loci commonly show additive 
inheritance. Many genes that modify a tumor pathway are haploinsuf- 
ficient as judged by the quantitative phenotype of heterozygotes for 
knockout alleles. Examples include but are not limited to Mmp7 (Wilson 
et al. 1997), T-cell factor 4 (Angus-Hill et al. 201 1), Kruppel-like factors 4 
(Ghaleb et al. 2007) and 5 (McConnell et al. 2009), Park2 (Poulogiannis 
et al. 2010), E3 ubiquitin ligase Fbw7 (Sancho et al. 2010), SGOl 
(Yamada et al. 2012), cMyc (Yekkala and Baudino 2007), and SMAD4 
(Szigeti et al. 2012). The quantitative nature of tumor suppression has 
been reviewed by Largaespada (2001) and Payne and Kemp (2005). 
Berger et al. (2011) have generalized the dosage sensitivity hypothesis 
for tumor suppressor genes. Loss-of-function alleles are maximally in- 
formative in the analysis of chains of causation. As discussed previously, 
for such alleles, the evolved strategy is biased toward cases of dominant 
alleles involving haploinsufficiency. This bias may favor the identifica- 
tion of dosage- sensitive targets of drug action. 

B6.SNVg has been shown to be a phenotypically neutral isogenic 
partner for mapping new modifiers of B6-Apc Mm . All six B6.SNV lines 
are available from the McArdle Laboratory for Cancer Research at the 
University of Wisconsin to be developed for research purposes as 
candidate mapping partners for the modifier genetics of the quanti- 
tative phenotype displayed by other B6 strains carrying a dominant 
element such as a transgene that affects a biological pathway of in- 
terest. The sequence changes detected by Illumina resequencing of 
these strains are shown in File SI. Only the calls in strain B6.SNVg 
were further tested by Sequenom analysis (File S3). 



Other derivatives of B6 that have acquired new mutations by 
neutral drift or by strain admixture, such C57BL/6N, may also serve as 
isogenic mapping partners if shown to be phenotypically neutral 
(Simon et al. 2013). 

We can anticipate significant future enhancements in the modifier 
genetics of complex traits in experimental mammals. On the phenotypic 
side, we can expect advances in the molecular and cellular dissection 
of tumor pathways. The complexity of cancer as a genetic trait surely 
includes the biological complexity of the disease. Subdividing the 
phenotype enhances the statistical power to detect a modifying 
mutation through its selective action on one component of such 
a complex system (see March et al. 2011; Green et al. 2011).This 
strategy has been demonstrated for circadian biology by Takahashi 
and his colleagues (Chen et al. 2012). On the genetic side, we can 
expect further improvements by a new generation of mapping part- 
ners: using the efficient one-step allelic substitution process (Wang 

Table 2 Mutational pathway of the 1 3,1 72 line-specific candidate 
variants 



Line-Specific Candidate Variants 



Mutation type 


Ref 


Alt 


N 


Percent 


Transition 


A 


G 


2142 


16 




T 


C 


2497 


19 




C 


T 


1540 


12 




G 


A 


1435 


11 


Transversion 


A 


C 


516 


4 




T 


G 


598 


5 




C 


A 


683 


5 




G 


T 


649 


5 




A 


T 


1114 


8 




T 


A 


1352 


10 




C 


G 


332 


3 




G 


C 


314 


2 
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et al. 2013), a geneticist can engineer an isogenic mapping partner of 
an inbred strain by introducing a planned set of SNV mapping sites 
genome-wide, designed to be functionally neutral. 6 

A genome-wide genetic screen for mutagen-induced quantitative 
modifiers of the complex Apc Mm tumor phenotype can expand in 
important directions our molecular and cellular understanding of 
the complexities of intestinal neoplasia. This report illustrates the 
obstacles that must be overcome to achieve this end, and documents 
the creation of isogenic mapping partners for the canonical mouse 
strain C57BL/6J by which such sensitive genetic screens can be de- 
veloped. This strategy and its sequelae can complement other active 
programs for the analysis of complex biological processes. 
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